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Abstract 

Wc consider a multidimensional Ito semimartingale regularly sampled on [0, t] at high 
frequency 1/A„, with A„ going to zero. The goal of this paper is to provide an estimator 
for the integral over [0,t] of a given function of the volatility matrix. To approximate 
the integral, we simply use a Riemann sum based on local estimators of the pointwise 
volatility. We show that although the accuracy of the pointwise estimation is essentially 
aI/^, this procedure reaches the parametric rate aI/^ . However, the limiting process 
in the central limit theorem exhibits a bias. After a suitable bias correction, we obtain 
an unbiased central limit theorem for our estimator and show that it is asymptotically 
efficient within some classes of sub models. 
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1 Introduction 

Let X be a semimartingale, which is observed at discrete times iA„ for i = 0, 1, • • • , over 
a finite time interval [0,r], with a discretization mesh A„ which is small and eventually 
goes to (high-frequency setting) . One of the main problems encountered in practice is the 
estimation of the integrated (squared) volatility (in finance terms), or equivalently of the 
continuous part of the quadratic variation [X, X]^. 

By now, this is a well understood problem, at least when X is an Ito semimartingale. 
For example, in the continuous one-dimensional case, if X takes the form 

Xt = Xo+ [ bsds+ [ as dWs 
Jo Jo 
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the approximate quadratic variation Ylf=t"'^ (^iA„— -'^('t-i)A„)^) which of course converges to 
[X, X]t = /q cjg ds, enjoys a Central Limit Theorem (CLT): the difference between these two 
processes, normalized by ,\ , converges stably in law to a limit which is conditionally on 

V An 

X a continuous Gaussian martingale with quadratic variation (equivalently, with variance) 
twice the so-called "quarticity" , that is 2 J^^a^ds. 

Although later we consider a much more general framework, allowing X to be multi- 
dimensional and with jumps, in the introduction we pursue the discussion in this special 
one-dimensional continuous case. In various statistical problem one needs to estimate not 
only the quarticity, but functionals of the form 



yi9)t= / g{cs)ds, where Cs = 
Jo 



for relatively general test functions g, and to derive associated CLTs, see [6]; notice that 
we plug in the "spot" squared volatility ct rather than at, since in any case it is impossible 
to determine the sign of at on the basis of the observation of the path t Xt. The case 
g{x) = X corresponds to the usual integrated volatility, and g{x) = to the quarticity. 

Toward this aim, two methods are currently at hand: 

1. The first one is available if g{x) = E(/(C/(x)i, ■ ■ • , U{x)k)) for all x > 0, where the 
U{x)j^s are independent J\f{0,x) variables and / is a continuous function on M.^, of 
polynomial growth. Then we know that 

[i/A„]-fc+l ^„ ^ 

C/-(/), = A„ /(-^,---, ), where ArX = X,A.-X(,_i)^„, 



(1.1) 

converges to V{g)t in probability, and if / is the rate of convergence is An, and 
in the associated CLT the limiting conditional variance is fQF{cs)ds for a suitable 
function F. 

2. The second one consists in using estimators for the spot volatility and approximating 
the integral V{g)t by Riemann sums, in which the spot volatility is replaced by its 
estimator; that is, we set 

[t/A„]-fe„+l fc„_l 
V-{g)t = An where = — - J] {^+,Xf (1.2) 

1=1 j=0 

for an arbitrary sequence of integers such that fc„ — )■ oo and knAn — )• 0. Then one 

F 

knows that V^{g)t — > V{g)t (when g is continuous and of polynomial growth). But 
so far nothing is known about the rate of convergence of these estimators. 

The first method is quite powerful and gives optimal rates, but the special form of g 
puts strong constraints on g (for example, it is C°° on (0, oo), and much more). To tell the 
truth, by far the most useful test functions g are the powers gp{x) = x^ (recall that x > 
here) for p > 0, which are associated as above with fp{x) = \x\^p /m2p, where ruq is the gth 
absolute moment of AA(0, 1). Nevertheless, some functions g of interest are not, or not in an 
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obvious way, of this form or, more generally, linear combinations of functions of this form. 
This is particularly true in the multivariate case, with X being d-dimensional and thus U 
above as well: for an arbitrary (smooth) function g on the set of all d x d symmetric 
nonnegative matrices, it is rather a difficult task in practice to find an integer k > 1 and 
a function / on (M*^)*^ such that, for all x £ M'^ , we have g{x) = E(/(C/(x)i, • • • ,U{x)k)), 
where again the U{x)j^s are (d-dimensional) i.i.d. M{0,x). 

In addition, this first method does not provide efficient estimation in general. To see that, 
consider the toy example Xt = aWt, where <t is a constant, and c = o"^, and A„ = ^ and 
T = 1. We thus observe the increments for i = 1, • • • , n, or equivalently the n variables 
Yi = A'^X/^An- These variables are i.i.d. AA(0, c), so the asymptotically best estimators for 
c (efficient in all possible senses, and also the MLE) are Cn = ^ Y17=i0^i)'^ ~ J2'i=ii'^7 -^)'^ ^ 
with convergence rate ^/n and asymptotic variance 2c^. If instead one wants to estimate 
cP for some p ^ 1 in (0,oo), one can use c{p)n = E7=i = ^ E7=i lA^XpP, 

and the ordinary Central Limit Theorem tells us that the rate of convergence is again y/n, 
and the asymptotic variance is ^'^^ ^^'^^ c^: this is exactly what the first method above does. 

But this is not optimal, the asymptotically optimal estimators being (c„)^ (the MLE again), 
with rate -y/n and asymptotic variance 2p'^c^P, smaller than the previous one when p ^ 1. 
Now, taking {cn)^ is exactly what the second method (|1.2p does. 

The aim of this paper is to develop the second method, and in particular to provide 
a Central Limit Theorem, with the rate An, and with an asymptotic variance always 
smaller than if one uses the first method. About efficiency, and despite the title of the paper, 
we do not really examine the question in the general non-parametric or semi-parametric 
setting assumed below, since even for the simpler problem of estimating integrated volatility 
the concept of efficiency in this setting is not well established so far. Instead, we will term 
as "efficient" a procedure which is efficient in the usual sense for the sub-model consisting 
in the toy model Xt = crWt above, or for more general sub-models for which the concept 
of efficiency is well defined: for example our estimators will be efficient in the sense of the 
convolution theorem, for the Markov-type models recently studied by Clement, Delattre and 
Gloter in [2]. 

This will be done in the multivariate setting and when X possibly has jumps (upon 
suitably truncating the increments in (II. 2p if it is the case, in the spirit of Oil]), and under 
the additional assumption that ct itself is an Ito semimartingale. We should mention right 
away that for the CLT one needs to add to V"'{g) a suitable de-biasing term, and also that 
when X jumps one needs in addition the jumps of X to be summable, which is exactly the 
same assumptions under which the truncated versions of U^{f) in (|l.ip converge with rate 

The paper is organized as follows: Section [2] is devoted to presenting the assumptions. 
Results are given in Section [3l and all proofs are gathered in Section |4l 

2 Setting and Assumptions 

The underlying process X is d-dimensional, and observed at the times iA„ for i = 0, 1, • • • , 
within a fixed interval of interest [0, t\. For any process we use the notation A^Y defined in 
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(jl.ip for the increment over the ith observation interval. We assume that the sequence A„ 
goes to 0. The precise assumptions on X are as follows: 

First, X is an Ito semimartingale on a filtered space {^,J-', {J-t)t>Oj^)- It can be written 
in its Grigelionis form, as follows, using a d-dimensional Brownian motion W and a Poisson 
random measure ^ on x E, with E is an auxiliary Polish space and with the (non-random) 
intensity measure ^{dt, dz) = dt<Si X{dz) for some a-finite measure A on E: 

Xt = Xo + Jl,bsds + f^asdWs + fj^6{s,z)l{\isis,z)\\<i}{f^-J^){ds,dz) 

+ /o Ie '^(■5' ^) 1{||<5(.,2)||>1} Kds, dz). 

This is a vector-type notation: the process bt is M'^-valued optional, the process at is M"'(g)M'^- 
valued optional, and 6 = 6{uj, t, z) is a predictable M'^-valued function on x R_(_ x E. 

The spot volatility process ct = ata^ (* denotes transpose) takes its values in the set 
of all nonnegative symmetric d x d matrices. We suppose that ct is again an Ito 
semimartingale, which can be written as 

Ct = C0 + jQbsds + jQasdWs + Jq JE6{s,z)^l^^^g^^^^^^^^^ ^^^^ 

+ /o Ie '^(^' ^) l{||5(s,.)||>i} f'ids, dz). 

with the same W and /U as in (12. ip . This is indeed not a restriction: if X and c are two Ito 
semimartingales, we have a representation as above for the pair {X, c) and, if the dimension 
of W exceeds the dimension of X one can always add fictitious component to X, arbitrarily 
set to 0, so that the dimensions of X and W agree. Notice also that since q is symmetrical, 
in ()2.2p we could restrict our attention to the components Cj^ with 1 < j < I < d, for 
example. From a notation viewpoint, however, it is simpler to model the full matrix ct, even 
though it is redundant. 

In (j2.2p . b and a are optional and 6 is as 6; moreover b and 6 are M'^^-valued. Finally, we 
need the spot volatility of the volatility and "spot covariation" of the continuous martingale 
parts of X and c, which are 



d 



~ij,kl \ ^ ~ij,m~kLm ~/i,jk \ il~jk,l 

Ct = 2^ , c^'' - ^ - ' 

m=l 1=1 



The precise assumption on the coefficients are as follows, with r a real in [0, 2] and 
denoting the Euclidean norm on any finite-dimensional linear space: 



Assumption (A-r): There are a sequence (Jn) of nonnegative bounded A-integrable func- 
tions on E and a sequence (r„) of stopping times increasing to oo, such that 

t<Tn{uj) =^ \\6{UJ, t,z)\\'- A1+ \\6{uj,t,z)\\^ Al< Jn{z) (2.3) 

Moreover the processes bt, bt, ct and ^ are cadlag, and the maps t i— )• 5{uj,t,z) and t i— )■ 
6{uj,t,z) are caglad (recall that 6 and 5 should be predictable). □ 



The bigger r is, the weakest Assumption (A-r) is, and when (A-0) holds the process X 
has finitely many jumps on each finite interval. The part of (A-r) concerning the jumps of 
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X implies that J2s<t \\^-^s\\^ < oo a.s. for all t < oo, and it is in fact "almost" implied by 
this property. 

Remark 2.1 Some degree of regularity in time seems to be needed for b,b,c,(? ,5,5. For 
h,h,c,'S one could replace cadlag by caglad (or replace for instance ht by ht-), since only the 
integrals of these four processes with respect to Lebesgue measure are relevant. 



3 The Results 

3.1 A First Central Limit Theorem. 

For defining the estimators of the spot volatility, we first choose a sequence kn of integers 
which satisfies, as n — )• oo: 

kn ^G (0,oo), (3.1) 

and a sequence Un in (0, oo]. The A^j^-valued variables are defined, componentwise, as 

^''"^ " ^ ^r+j-^' A^+j-X™ 1{||A7 X|l<n„}, (3.2) 

and they implicitly depend on A„, kn,Un- 

One knows that cj^/^ ] — > ct for any t, and there is an associated Central Limit Theorem 

under (A-2), with rate the choice (13. ip is optimal, in the sense that it allows us to 

have the fastest possible rate by a balance between the involved "statistical error" which 
is of order \j\fk^, and the variation of q over the interval \t,t + /c^A^], which is of order 
^kn^n because q is an ltd semimartingale (and even when it jumps), see [HE]. 

By Theorem 9.4.1 of [6], one also knows that 



[i/A„]-fc„+l 

/O 



V{gTt:=An V 5(2?) V{g)t:= I g{c,) ds (3.3) 



i=l 



(convergence in probability, uniform over each compact interval; by convention Vi = 

whenever b < a), as soon as the function g on Ai'^ is continuous with \g{x)\ < K{1 + \\x\\^) 
for some constants K,p, and under either one of the following three conditions: 



(A-0) holds, X is continuous, — )• oo for some e < I (for example n.„ = oo) 
(A-2) holds and p <l and n„ x A^ for some G (O, i) (3.4) 

~ p-l l^ 



(A-r) holds for some r G [0, 2) and p > 1 and Un >i A^ for some G [^^, 



Notice the upper limit in the definition (j3.3p of T^'^((7)t : this is to ensure that V"'{g)t is 
actually computable from the observations up to the time horizon t. Note also that when 
X is continuous, the truncation in (j3.2p is useless: one may use ()3.2p with ti„ = oo, which 
reduces to (11.211 in the one-dimensional case. 
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Now, we want to determine at which rate the convergence ()3.3p takes place. This amounts 
to proving an associated Central Limit Theorem. Under the restriction r < 1 and an 
appropriate choice of the truncation levels, such a CLT is available for V{g)"', with the rate 
An, but the limit exhibits a bias term. 

As a matter of fact, the bias results from two factors. The first one is due to a border 
effect: the formula giving V"'{g)t contains [t/A„] — kn + 1 summands only, whereas the 
natural (unfeasible) approximation A„ 5(c(j_i)^^J contains [t/A„] summands. The 

sum of the lacking kn summands is of order of magnitude {kn — 1)A„, which goes to 
and thus does not impair consistency, but it creates an obvious bias after normalization by 
An- Removing this source of bias is straightforward: since g{cs) is "under-represented" 
when s is close to or to t, we replace V''^{g)t by 

[t/A„]-k„+l _ 

V'{g)7 = An Yl " 2 " (g(^)+g(^VA.M.+i)- (3-5) 

1=1 

Of course, other weighted averages of g{c['') for i close to or to [t/A„] — kn + I would be 
possible. 

The second source of bias is more subtle, as seen from its specific form given in our first 
theorem below. Below, g is a smooth function on A^^, and the two first partial derivatives 
are denoted as djkg and dj^^^^g, since any x G Ai^ has (P components x^'^. The family of 
all partial derivatives of order j is simply denoted as g. 

Theorem 3.1 Assume (A-r) for some r < 1. Let g be a function on Ai'^ such that 

119^^)11 <i^(l + ||x|r^'), j = 0,l,2,3 (3.6) 

for some constants K > 0, p > 3. Either suppose that X is continuous and n„/A^ — t- oo 
for some e < 1/2 (for example, Un = oo, so there is no truncation at all), or suppose that 

^P-' <^<i. (3.7) 



" ' 2{2p - r) - 2 

Then we have the finite- dimensional (in time) stable convergence in law 

(F'(ff)r - V{g)t) "-^ 4^^' + 4^)'^ + A? + Z,, (3.8) 



where Z is a process defined on an extension {Q,,F,{Tt)t>o,^) of {Q.,F,{Ft)t>Q-,'^), which 
conditionally on T is a continuous centered Gaussian martingale with variance 

E((Zt)2|^) = fd,kgics)dim9{cs){c'M"' + c'^cf)ds, (3.9) 

, 1 Jo 



and where 



j,k,l,m=l 
j,k,l,m=l 

4^^'' = -^ E Iod]kM9ics)^^'"'ds (3.10) 

j,k,l,m=l 

=^EG'(c,_,Ac,). 

s<t 
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and 



Gix, y) = {g{x + wy) - (1 - w)g{x) - wg{x + y)) dw 
Jo 



(3.11) 



(Note that \G{x,y)\ < K{1 + ||x||)^ so the sum defining ^4^ is absolutely conver- 
gent.) When ct is continuous we have A^'^^ = and it can be proved that the convergence 
(j3.8p also takes place in the "functional" sense, but it is impossible to have functional con- 
vergence when Ct has jumps because the limit in (|3.8p has jumps, whereas the jumps of the 
left side are vanishing as n — t- oo: this prevents the convergence for the Skorokhod topology. 

Remark 3.2 The bias has three parts: 

1) The first part A^^^'^ is continuous in time and is present even for the toy model Xt = 
^/cWt with c a constant and A„ = ^ and T = 1. In this simple case it can be interpreted as 
follows: instead of taking the "optimal" g{cn) for estimating g{c), with c„ = X]iLi(^r^)^' 
one takes ^ Yl^=i 9^) with a "local" estimator of c. This adds a statistical error which 
results in a bias. Note that, even in the general case, this bias would disappear, were we 
taking in (|3.ip the (forbidden) value 9 = oo (with still A;„A.„ — )• 0, at the expense of a slower 
rate of convergence. 

2) The second and third parts A^"^^'"^ and A^'^^ are respectively continuous and purely 
discontinuous, due to the continuous part and to the jumps of the volatility process q itself. 
These two biases disappear if we take 6* = in (j3.ip (with still kn — )• oo), again a forbidden 
value, and again at the expense of a slower rate of convergence. 

The only test function g for which all these biases disappear is the identity g{x) = x. This is 
because, in this case, and up to border terms, V'{g)t is nothing but the realized quadratic 
variation itself and the spot estimators actually merge together and disappear as such. 

Remark 3.3 Observe that (j3.7p implies r < 1. This restriction is not a surprise, since 
one needs r < 1 in order to estimate the integrated volatility by the (truncated) realized 
volatility, with a rate of convergence An. When r = 1 it is likely that the CLT still 
holds fore an appropriate choice of the sequence n„, and with another additional bias, see 
e.g. [7] for a slightly different context. Here we let this borderline case aside. 

3.2 Estimation of the Bias. 

Now we proceed to "remove" the bias, which means subtracting consistent estimators for 
the bias from V'{g)f. First, A^^^'^ = ^ V{h) for the test function h given on by 




(3.12) 



j,k,l,m=l 



Therefore 




1 



(3.13) 



(One can also use y(/i)", of course.) 



Integrated function of volatility 



8 



Secondly, the term A). involves the volatility of the volatility, for which estimators 
have been provided in the one-dimensional case by M. Vetter in [8]; namely, if d = 1 and 
under suitable technical assumptions (slightly stronger than here), plus the continuity of Xt 
and ct, he proves that 

[t/A„]-2fc„+l 
" j=l 

converges to Jq (cg + ^ (cs)^) ds. Of course, we need to modify this estimator here, in order 
to include the function d^g in the limit and account for the possibilities of having d >2 and 
having jumps in X. We propose to take 

r-^ [t/A„]-2fc„ + l d 

Ar'' = -^ E E d%,m9(^){^^t-^'^'){^iZ-^'"')- (3-14) 

i=l j,k,l,m=l 

When X and c are continuous one may expect the convergence to ^'^ - i ^'^ (observe 

that ^^-^ ~ 2!" ^^"^ "-"^^ may expect the same when X jumps, because in ()3.2p the 
truncation basically eliminates the jumps of X, and when c is still continuous. In contrast, 
when c jumps, the limit should rather be related to the "full" quadratic variation of c, and 
indeed we have: 

Theorem 3.4 Under the assumptions of Theorem \3.1\ for all t > we have 

A^'' A + (3.15) 

where 

4"^ =9Y,G'{cs-,Acs) (3.16) 

s<t 

and 

G'{x,y) = -^ E I\0]k,im9i^)+d%^im9i^ + i^-^)y))^'y'''y'"'dw. (3.17) 

j,k,l,m ^ 



At this stage, it remains to find consistent estimators for Af" — A'^ , which has the form 
^id) _ ^/(d) ^ ^ ^ G"(c,_, Ac), where G" = G - G' . 

s<t 

More generally, we aim to estimating 

V{F)t = Y^F{cs-,Acs), (3.18) 

s<t 

at least when the function F on x A^^j where Ai^ is the set of all d x d matrices, is 
and \F{x,y)\ < uniformly in x within any compact set, as is the function G" above. 
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The solution to this problem is not as simple as it might appear at first glance. We first 
truncate from below, taking any sequence u'^ of truncation levels satisfying 

u' 1 
< ^ 0, ^ for some w' G (O, -) (3.19) 

Second, we resort on the following trick. Since is "close" to the average of q over the 
interval (iA„, (i + A;„)A„], we (somehow wrongly) pretend that, for all j: 

3s G ((j - l)knAn,jknAn] with ||ACs|| > m'„ <^ ||C^^^ ~^i-2)fc„ll > 

Ar r^7^ —r^ Wr^ — II \/ 117^ — II^IIP^ — II 

^'-s ^jk„ ^0-2)fc„' \\^{j-l)k„ ^(j-3)fc„ll V l|t-(j+l)fc„ '^(i-l)fc„ll ^ llSfcn ^0-2)fc„ll- 

(3.20) 

The condition (j3.19p implies that for n large enough there is at most one jump of size bigger 
than u'^ in each interval {i — 1)A„, (i — 1 + A;„,)A„] within [0,t], and no two consecutive 
intervals of this form contain such jumps. Despite this, the statement above is of course 
not true, the main reason being that and c" do not exactly agree. However it is "true 
enough" to allow for the next estimators to be consistent for V{F)t: 

where <5;c = cj'^.^+i - c^._2),^_,,. 

Since this is a sum of approximately [t//c„A„] terms, the rate of convergence of V(-F)" 
toward V{F)t is law, probably only. However, here we are looking for consistent 

estimators, and the rate is not of concern to us. Note that, again, the upper limit in the 
sum above is chosen in such a way that V(-F)" is computable on the basis of the observations 
within the interval [0,t]. 

Theorem 3.5 Assume all hypotheses of Theorem \3.1{ and let F he a continuous function 



on IR+ X M satisfying, with the same p > 3 as in (3.1), 

|F(x, y)\ < K{1 + ||x|| + ||y||)P-2 \\yf. (3.22) 

Then for all t > we have 

V(F)r A V{F)t. (3.23) 
3.3 An Unbiased Central Limit Theorem. 

At this stage, we can set, with the notation p.l2p . (|3.13p . ()3.14p and (|3.2ip . and also (|3.1ip 
and (IXTTD for G and G': 

^ •n,c,l . <n,c,2 



V{g)^ = V'{g)^ - VA„ - ^r-'' + A^'" " krAn V{G - G')^. (3.24) 



We then have the following, which is a straightforward consequence of the three previous 
theorems and of /c^-v/An O, plus the fact that the function G — G' satisfies (j3.22p when g 
satisfies (13.61): 
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Theorem 3.6 Under the assumptions of Theorem \3.1\ and with Z as in this theorem, for 
all t > we have the finite- dimensional stable convergence in law 

^ mg)7 - Vig)t) "-^ Z,. (3.25) 



Note that Q no longer exphcitly appears in this statement, so one can replace (j3.ip by 
the weaker statement 

K - -y= (3.26) 



(this is easily seen by taking subsequences ni such that km yAn^ converge to an arbitrary 
limit in (0, oo)). 

It is simple to make this CLT "feasible", that is usable in practice for determining a 
confidence interval for V{g)t at any time t > 0. Indeed, we can define the following function 
on M'^: 

d 

h{x)= d,,,g{x)dimg{x){xi'x^'^ + x^^x^'). (3.27) 

j,k,l,m=l 

We then have V{h)'^ ==^ ^(^)) whereas V{h)t is the right hand side of (|3.8p . Then we 
readily deduce: 

Corollary 3.7 Under the assumptions of the previous theorem, for any t > we have the 
following stable convergence in law, where Y is an J\f{0, 1) variable: 



AnVih)^ 



y, in restriction to the set {V{h)t > 0}, (3.28) 



Finally, coming back to optimality, the following gives two very small steps toward 
optimality. For simplicity we restrict our attention to the one-dimensional case d = 1. 

Theorem 3.8 Let d = 1 and g be aC^ function on satisfying 113. 6\) and which is strictly 
increasing, or strictly decreasing. 

a) For the parametric model Xf = crWt, where ct = of = c is a constant (the toy 
example of Introduction), for any t > the estimators V{g)i are asymptotically efficient (in 
he Cam's sense) for estimating the number tg{c). 

b) Let (T")„>i be another sequence of estimators, with T" being a function of (^iA„ '■ 
< i < [t/A„]), and such that for any X satisfying (A-{)) and continous, the variables 

} (T" — V{g)t) converge stably in law to a limit Z[, defined on an extension of the space, 
and with a conditional variance of the form 

t 



E{{Z'tf\T) = / H{cs)ds (3.29) 



for some nonnegative Borel function H . Then necessarily H > h, as given by \3.2D , and 
in particular 

^{{Z[f I J^) > ^{{Ztf I J"). (3.30) 
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In particular, the estimators U^{f)t given in (II. ip for estimating g{x) = E,{f(y/xU)) 
have always an asymptotic variance bigger than or equal to the variance (j3.9p . 

There is another way in which the estimators V{g)f are optimal, in the sense that the 
Hajek convolution theorem holds under appropriate assumptions. More specifically, let S 
be some subclass of the class of all semimartingales X satisfying (A-r). Let t > 0. The 
estimators V{g)]^ are asymptotically efficient in the convolution theorem sense and within 
S if, for any sequence (T") of estimators which are functions of (^iA„ : < i < [t/A„]) (as 
above) and such that {TJ^ — V{g)t)) converges in law, whatever X in the class 5, then 
for each X in 5 the limiting variable Z'j. of (Tp — V{g)t) can be realized as Zt^Z'l^ where 
Zt is the limiting process in (|3.25p and the variable Z'l is independent of Zt conditionally 
on T . 

Then (a) above yields efficiency of Vi^gyi in this sense for the class S of all Xi = aWt 
for fj > arbitrary. But efficiency also holds in a much wider class, as been shown in [2]: 
namely, for the class S of all one-dimensional continuous X such that the volatility a has the 
form (Tt = f{t,Xt,Yt), where Yt is another Ito semimartingale independent of the Brownian 
motion driving X, plus some smoothness on the coefficients of the equations. 

So far, however, a "general" theory of optimality in our non-parametric context seems 
still out of reach. 



Remark 3.9 For a given function g, the variables V{g)f and V'{g)2 are easy to compute 
on the basis of the observed increments A"X, and the same is true of the de-biasing terms 
^"'"'^ and . The de-biasing term V(G - G')^, which occurs when c has jumps, is also 
straightforward to compute, as long as one has an explicit expression for G — G' . In some 
instances G — G' has a closed form expression, for example when g is a linear combination 
of functions of the form Yl^^^x"^^'^^ for any choice of the indices mj and qj between 1 and 
d (repeated indices are of course possible). Otherwise, one may always rely upon numerical 
integration to compute (G — G')(x, y) for the observed values x = and y = 6'jC. 



Example 3.10 (Quarticity) Suppose d = 1 and take g{x) = x^, so we want tho estimate 



the quarticity Jq Cg ds. In this case we have 



,2 

h{x) = 2x\ G{x, y) = 0, G'(x, y) = h{x) = 8x^. 

6 

Then the "optimal" estimator for the quarticity is 

+ (^^ ((^)2 + (^-/^^j„,^^^)2) 

—l^j=^ \^3^> l{||57_ic1|Vj|5]V,J^|V«pycll}, 

the last term being unnecessary when one knows that q is continuous. 

The asymptotic variance is 8 Jq ds, to be compared with the asymptotic variance of 
the more usual estimators 3^ l^f=f "'(^r^^)^' which is ^ ds. 
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4 Proofs 

4.1 A Localization Procedure. 

We start by showing that one can replace (A-r) by the following stronger assumption: 
Assumption (SA-r): We have (A-r) and moreover: 

(i) Ti = oo identically, the five processes c, b, h,c,'S and the two functions 5, 5 are uniformly 
bounded; 

(ii) for any a > there is a non-random integer Na such that all six cadlag processes ht, bt, 
ct, ct, c^, and = J z)\\'^ \{dz) have at most Na jumps with size bigger than a (since 
||(5(u;, t, A 1 < J{z) and 5 is bounded by (i), the process 74 is finite-valued and cadlag). □ 

The following localization lemma looks familiar for the sort of problems at hand. However 
there are two special requirements here: the condition (ii) above, and the connection between 
X and c, which we have to maintain when we localize. For these reasons we provide a 
(sketchy) proof. 

Lemma 4.1 It is sufficient to prove the four theorems \3.1\ \3.4\ \3.5\ and \3.6\ under the 
stronger assumption (SA-r). 

Proof. Denote by (SA'-r) the assumption according to which (A-r) and (i) of (SA-r) hold. 
By the usual localization procedure and a two-steps argument, it is enough to show that 

if X satisfies (A-r) (resp. (SA'-r)), there is a (not necessarily increasing) sequence Sn of 

p 

stopping times such that Sn — > 00 and a sequence X^ of processes satisfying (SA'-r) (resp. 
(SA-r), such that Xt = for all t < Sn- 

Step 1) We start with X satisfying (A-r). Set ipnix) = 1 A (2 - ^)''' for 2; > 0, and 
fn{x) = xi;n{\\x\\) for X G M'^ and 7„(y) = y^pn{\\y\\) if y G and 

Sn = Tn/\inf [t : \\Xt\\ + ||q|| + \\bt\\ + \\bt\\ + \\ct\\ > n). 
First, we define a new process c" as 

Ct = ctAS„- Yl (Ac, -7„(Ac,)). 

s<tAS„ 

It satisfies ()2.2p with the new coefficients 

b't =bt l{t<s„}, = l{t<s„}, S"'it,z) = fn(S{t,z)) l{t<s„}. 

By construction the processes 6" and c^ = ^"5^ " are cadlag with norms bounded by n, 
and S'^{t,z) is caglad in t and satisfies < 2n and ||(5"(t,z)|p A 1 < Jniz). Moreover 
||Act|| < 2n for t < Sn and fn{y) = y when ||y|| < 2n, implying that c" = ct for t < Sn- 
When t > Sn we have cf = cs^- + Ac^^Vndl Acs„ ||); since both cs„_ and cs„_ + Acs„ 
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are A^^-valued and < i/'n < 1, one easily deduces that = cg^^ is also TWj'-valued. 
Hence, taking any /"g^ -measurable version of the square-root a^^ of Cg^, one can define the 
following process o"", which satisfies o""o""* = cf for all t: 

We can now define X" by (12. ip . with the coefficients 

&r = bt l{t<s„}, cr" as above, z) = /„((5(t, z)) l{t<s„} 

(note that X" is not freezed at time Sn, since its continuous martingale part continues to 
evolve as a Brownian motion after this time). Exactly as above, the process b^" is cadlag 
bounded by n, and 5^{t,z) is caglad bounded by 2n and A 1 < Jn{z), whereas 

c" is cadlag and bounded by 3n by construction. Finally, the density c?" = a"'a^'* of the 
quadratic covariation of the continuous martingale parts of X" and c" is bounded by n 
(because ||c'"'|| < vl|c^|P^ by Kunita-Watanabe inequality) and cadlag, since it equals 
Ctl{t<5„}- 

All this implies that the process X"' satisfies (SA'-r) (with Ji for X" being the function 
Jn associated with X). Moreover we have seen that c" = ct when t < Sn and, again as 
previously, ||AXi|| < 2n and thus fn{^Xt) = AXt if t < Sn, implying X" = Xt if t < Sn- 
Since further 5^ f oo we have proved that we can replace (A-r) by (SA'-r). 

Step 2) Now we suppose that X satisfies (SA'-r), so the six processes bt,bt,ct,ct,c?i,^t are 
all cadlag. For each a > we let Tf',T2,--- be the successive times at which at least 
one of these six processes has a jump with size bigger than a. Note that Tj* < T? if 

a' > a. For all n,j > 1 there is an integer l{n,j) such that ¥{T^^Jj^ < ra) < 2^^ /n. Then 
Sn = ini(T}/J : j = 1, 2, • • • ) satisfies P(<S'„ < n) < ^ and thus Sn oo. 

At this point, for n > 1 we first define c" = ct/\s„, which satisfies ()2.2p with 

bt =btl{t<Sn}^ =^tl{t<Sn}^ ^"(i,^) =^(^,^)l{^<5„}• 

We also set cr" = at/\s„, hence crfcr"* = c" identically. Next, we define by (|2.ip with the 
coefficients 6" = bt Ijj^^^^j and a" as above and J" = 6. It is easy to check that each X" 
satisfies (SA-r), with Na = l + l{n,j) if i < a < -pj, and that Xj" = X^ if t < Sn, so the 
proof is complete. □ 



In view of this lemma, we will assume below that X satisfies (SA-r). In the sequel, all 
constants are denoted by K, and they vary from to line. They may implicitly depend on 
the process X. When they depend on an additional parameter p, we write Kp. 

In particular, we have for some A-integrable nonnegative function J on E: 

\\b\\, \\b\\, \\c\\, \\c\\, W^W, J < K, \\6iu;,t,zW <Jiz), ||5(a;, t, z)f < J(z), (4.1) 

For convenience, we will slightly modify our notation for the drift processes. Indeed, since 
5 and 5 are bounded, we rewrite (j2.ip and (j2.2p as follows: 

Xj = Xo + /q bs ds + /g as dWs + /g J^^is, z) (p - v) {ds, dz) 
Q = Co + /d 6s ds + Jl as dWs + /g ?(s, z) {pL - v){ds, dz). 
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This amounts to replacing b in ()2.ip by the bounded cadlag process bt + K'{6{t+, z)) X{dz), 
and likewise for b, and these new versions of b and b still satisfy ()4.ip . As a matter of fact, 
below we will most often use the "continuous part" of X, which is 



Xl = Xo+ I bsds+ I asdWs. (4.3) 
Jo Jo 

We end this subsection with well known estimates for X' and c, under ()4.ip and for 
s,t > and q > 0: 



(4.4) 



E(sup^e[o,.] m^^-XlW' I Tt) < K,s<^/\ mXi^^-Xi I Ts)\\ < Ks 
E(sup^g[o,,] - Qr I ^t) < /Cgs^^^''/^), ||E(cf+, - ct I J;)|| < Ks. 

4.2 An Auxiliary Result on Ito Semimartingales 

In this subsection we give some simple estimates for a d-dimensional semimartingale 

Yt= [ b^ ds+ [ dWs+ [ [ 6^ {s, z) {fx - iy){ds,dz) 
Jo Jo Jo Je 

on some space (O, J-, {J-t)t>o^ IP)) which may be different from the one on which X is defined, 
as well as W and but we still suppose that the intensity measure v is the same. Note 
that Iq = here. We assume that for some constant A and function we have, with 

\\b^\\<A,\\c^\\<A^, \\5^{u,t,z)f <J^{z)<A^, [ (z) X{dz) < A^ . (4.5) 

Je 

The compensator of the quadratic variation of Y is of the form JqcJ ds, where cj = cY + 
Jj^ S^{t, z) 5{t, z)* \{dz). Moreover, if the process is itself an Ito semimartingale, the 
quadratic covariation of the continuous martingale parts of Y and is also of the form 
jl^c'Jds for some process c?^, necessarily bounded if both Y and satisfy (j4.5p (and, if 
Y = X, we have = c and c'^ = c'). 

Finally we associate with any process Z the variables 



r]{Z)t = A/E(sup,<i \\Zs - ZoP I -^o). (4.6) 



Lemma 4.2 Below we assume (^TJp, and the constant K only depends on A. 
a) We have for t G [0, 1].- 

\\nyt\^o)-tbl\\<t^{b'')t<Kt 

|E(y/ Yr I -Fo) - tc^'^'"! < Kt{t + Vtrfib^^ + r?(c^)0 < Kt, ^ 
and if further ||E(c^ — | J^o)\\ < A^t for all t, we also have 

|E(y/ Yt"" I To) - tc^'^™| < 2*3/2(2^2^ + Ari{b^)t) < Kt^''^. (4.8) 
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b) When Y is continuous, and i/E(||c^ — c^P | J-^o) ^ ^"^^ for all t, we have 

|E(y/ Yl^ Yl I Jo) - t'(c^'^^^''" + c^c^^''"' + c^'^'^c^-'^') I < Kt5/2. (4.9) 

c) When c^ is a (possibly discontinuous) semimartingale satisfying the same conditions 
|^.5[ j as Y , and if Y itself is continuous, we have 

\mY^ Y^ - tc^'''){cr"' - c^''^) I J-o) < Kt'l\Vt + r,{7!'')t). (4.10) 

Proof, a) The first part of (j4.7p follows by taking the J-Q-conditional expectation in the 
decomposition Yt = Mt + tb^ + J^ibJ — bQ)ds, where M is a d-dimensional martingale with 
Mq = 0. For the second part, we deduce from Ito's formula that y-?y™ is the sum of a 
martingale vanishing at and of 

bi fY^ds+b^ fYids+ fYr{Vs-^,)ds+ fY^(b^-b^)ds+cl^^t+ [\cj'^"^ -c^'^"^) ds . 
Jo Jo Jo Jo Jo 

Since E(||y|| | Tq) < KA\/i, as in ([Oil , we deduce the second part of KTh and also K8\i 
by taking again the conditional expectation and by using the Cauchy-Schwarz inequality 
and the first part. 

b) For any indices ji,--- ,j4 Ito's formula yields a martingale M vanishing at such 
that 

nf=i^/' = M,+EiJibr'^^u,^^^^^^^,Y^-ds 

+ 2^1<l,l'<d,l^l'%''' Io^l<m<4:,m^l,l'^s"' ds (4-11) 
^2 2^1<l,l'<d,lf^l' Jo^'^'^ '^0 ) lh<m<i,mf^l,l' Ub. 

We have used here the continuity of Y, which also implies EdlYtH*^ | J^q) < KqA'^t'^^'^ for 
all g > 0. Again, we take the J^o-conditional expectation and we deal with the second, the 
third, and the last term in the right side above by Fubini's theorem and Cauchy-Schwarz 
inequality. For the fourth term we use ()4.7p . and a simple calculation yields the result. 

c) We have Y^Y^^ - tc^'^^ = Bt + Mt and cf'^™ - c^''"" = B[ + M/, with M and M' 
martingales (M is continuous). The processes B, B', {M,M), {M',M') and {M,M') are 
absolutely continuous, with densities bs, b'g, hg, h'^ and h'^ satisfying, by (|4.6|) for Y and c^: 

<2||y,|| 116^11 + llcf-c^ll, \b',\<K, \hs\<K\\Ysf, \K\<K, 

whereas /i'/ = y/^'^'.fc,'™ + y^fcc'^j;«'"_ ^s seen before, EdlytH"? | Fq) < Kqtil'^ for all q, and 
E(||ci^ - clf I Jo) < Kt. This yields ¥.{B'^ \ Fq) < Kt^ and ¥.{Mf \ Fq) < Kt^ . Since 
\B[\ < Kt and E(Mj'^ | Fq) < Kt, we deduce that the Fq- conditional expectations of BtB'^ 
and BtM[ and MtB[ are smaller than Kt^. 

Finally ¥.{MtM[ \ Fq) = E((M, M')t | Fq), and (M, M')t is the sum of Cg^''^''™ J^J y/ ds + 
Jp y/ (c^^''^'^™' — c^'^'^"^) ds and a similar term with k and j exchanged. Then using again 
E(||yj||2 I Fo) < Kt, plus ||E(yt I Fo)\\ < Kt and Cauchy-Schwarz inequality, we obtain that 
the above conditional expectation is smaller than K{t'^ + t^^'^r]{^^)t). This completes the 
proof of glO]). □ 
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4.3 Some Estimates. 

1) It is much easier (although unfeasible in practice) to replace cf in (j3.3p by the estimators 
based on the X' , as given by (14. 3p . Namely, we will replace by the following: 

The comparison between and c^" is based on the following consequence of Lemma 
13.2.6 of [6], applied with F{x) = xx*, so k = 1 and p' = s' = 2 and s = 1 and e = 
(because r < 1) with the notation of that lemma. Namely, we have for all g > 1 and for 
some sequence going to 0: 

K{\\iA^X A^X*) l{||Anx||<«„} - {A'^X' A'^X'*) l{||A.x'||<«„}|r)| < i^,a„ A^-)-+^ 

Since E(|| A^X'p'?) < KqAn for any g > by classical estimates, implying by Markov's 
inequality that E(||AfX'f 9 l{||Af > n„}) < KAV"'^^'^"^^ for any q' > 0, by taking 
q' > we then easily deduce 

E(||c7 - c'i"!!") < Kqan A(f'?-'^)"+i-'?. (4.12) 



2) The jumps of c also potentially cause troubles. So we will eliminate the "big" jumps 
as follows. For any p > we consider the subset Ep = {z : J{z) > p}, which satisfies 
A(£'p) < oo, and we denote by the a-field generated by the variables /i([0,t] x A), where 
t > and A runs through all Borel subsets of Ep. We let S^, S'2, • • • be the successive jump 
times of the Poisson process /i((0,t] x Ep), and T{n,i,p) the smallest Sj which exceeds 
(i — 1)A„, and ^n,t,p the set on which Sj_^i > t A Sj + (6/c„ + 1)A„ for all j > (with the 
convention Sq = 0; taking 6kn here instead of the more natural kn will be needed in the 
proof of Theorem 13.51 find makes no difference here). All these objects are ^^-measurable, 
and ¥(Qn,t,p) — )• 1 as ?i — )• 00, for all t, p > 0. 

We define the processes 

b{p)t = bt- I 6{t+,z)\{dz), c{p)t = ata*t+ I 6{t+, z)6{t+, z)* X{dz) 

JEp J{EpY 

c{p)t = ct - Jq f^^S{s,z) p{ds,dz) = c^^\p)t + c^'^\p)t, where 

c(i) {p)t = co + /o 6(^/5). ds + a, dW, (4.13) 
c^^H/')* = lo I{Epr^(^-'^) {p- i^){ds,dz), 

so c{p), which is M'^^ (E> M*^^ -valued, is the cadlag version of the density of the predictable 
quadratic variation of c{p). Moreover = {0,^2} and {b{p),c{p)) = {b,c) when p exceeds 
the bound of the function J. 
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By Lemma 2.1.5 and Proposition 2.1.10 in [6j applied to each components of X' and 
c(2)(p), plus the property \\b{p)\\ < K/p, for all t > 0, s G [0, 1], p G (0, 1], g' > 2, we have 



E( sup^e[o,.] m+u. - I V gp) < Kg s"/^ 
II E(X,V, - I V g^W + II E{c{p)t+s - c{p)t I Ts V gP)\\ < Ks 
E(sup^g[o,,] ||c(2)(p)i+^ - c(2)(p),||5 I V gP) < KgCl^p {s + s<i'^) 
E(sup^e[o,.] \\c{p)t-,^-c{p)t\\'i\Ttygp) < Kg[<ppS + s'^l^ + ^) <K,,pS. 



(4.14) 



where cpp = f^^ ^ J{z) X{dz) — as p — 0. Note also that ||&(/o)t|| < K/p. 
3) For convenience, we put 

Kp)i = ^(P){i-1)A„, c(p)7 = c(p)(i_i)A„, c(/))^ = c(p)(i_i)A„ (4.15) 

All the above variables are ^-""'''-measurable. We also set 

r?(p)t,s = sup (||yt+^ - Ytf : u e [0, s], Y = b,b{p),c,c{p),^) 



V{p)lj = ^E(r?(p)(,_i)A„jA„ I -^fO, r/(p)f = 7j{p) 



n 



Lemma 4.3 For all t,p > we have A„E(^|^^"' vip)?) ~^ 0? ^'^^ /o'" i) ^ suc/i i/iai 
j + k< 2K we have E(?7(p)[Yi,fc ' ^^'^^ - 

Proof. The second claim follows from the definitions of r][p)^ and r][p)^ ■ and Cauchy- 
Schwarz inequality. It is enough to prove the first claim when i]{p)t,s = sup„g[o^s] | |yt+u— ^tl P 
with y = 6 or y = h{p) or y = c{p) or y = c or y = c'. Recalling 7 in (SA-r), we have 
||A'J(p)^|| < ||A6s|| + KAjs/p (by Markov's inequality) and ||Ac(/))s|| < ||A(CT2j^|| + Ajs- 
Hence by (SA-r), and in all cases, the successive jump times of Y, with jump size || AY^ || > a, 
are Tf,T^,--- ,T^^ for some finite integer iV^. We set Y/" = Yt - Es<t^^sl{\\AY4>a}, 
with (random) modulus of continuity v{a,s) = sup„g[o^s]^t>o ll^t+u — ^t"ll- The variables 
v{p)iA„,k„A„ are smaller than some K, and also than t;(a, 2/c„A„)^ if the interval I{n,i) = 
{{i — 1)A„, {i + 2kn — 1)A„] contains no T?, hence 

Na 



Vip)7 < ^E(.;(a,2fe„A„)2| J-f'O+i^ j;P(r,,, G/(n,i)| J-f'O. 

i=i 

It follows that 

/ [t/An] 

a^=E(A„ Z V{p)l 
^ 1=1 

<E(A„ E y^E{v{a,kr,Any I :^tn)+KAn E H E P(T"G/(n,i) I J-f'''; 
^ i=l ^ j=l ^ i=l 



< t ^JE{v{a, 2/c„A„)2) + KAn Yl E E l{(i+m-l)A„<T"<(i+m)A„} 

j=l m=0 ^ i=l 

< t ^E{v{a, 2knAnY) + KknAnNa. 
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Since u(a, 2A;„A„) is bounded and limsup„ v{a,2knAn) < a, we deduce that limsup„ a" < 
ta and, since this holds for all a > 0, the claim follows. □ 

4) Let us introduce some additional notation. Firstly, we define the ^''-measurable (random) 
set of integers: 

L(n, p) = {i = 1, 2, • • • : - 1)A,, {i + 2A:„)A„) x Ep) = 0} (4.16) 
(taking above, 2kn instead of is necessary for the proof of Theorem 13. 4p . Observe that 
i E L(n, p), 0<j< 2K c^+j - c," = c(p)r+, - c{p)]. (4.17) 

Secondly, we define the following R'^ (g) M'^- valued variables 

af = AfX' AfX'* - cf A„, = A-k S'=o' 

= E -=0' c^+p _ Pf = P?-c^ = i E ■=o'(cF+, - cf) (4.18) 

Now we proceed with estimates. By Ito's formula and Burkholder-Gundy inequality and 
the first part of (|4.14p . for all > we get 



(the third estimate follows from the first one and Holder's inequality, the last one from the 
third one and c'^" = + plus ||/3f || < K). The third inequality above can be improved: 
using the second inequality in (I4.19p . by successive conditioning and again Burkholder- 
Gundy inequality applied to the martingale increments oi^j^ - — E(a"_^j- | we obtain 

E(iiarr I ^r) < k.a^. (4.20) 

Moreover, since the set {i G L{n,p) is ^^-measurable, the last part of ()4.14p . ()4.17p . and 
Holder's inequality, readily yield 

^9/2 



q>2,ieL{n,p) E(||/3rin J-f < ^.(v^^^P + A^/^ + ^). (4.21) 



5) The previous estimates are not enough for us. We will apply the estimates of Lemma [4.21 
with Yt = +t ~ ^(j-i)A given pair n, i, and with the filtration (-7^(i_i)A„+t V 

G^)t>o- We observe that on the set A{p,n,i) = {3j < 2/c„ : i — j £ L{n,p)}, which is Q^- 
measurable, and because of (I4.17p . the process coincide with c(/3)(j_i)A„+t — c(/9)(i_i)A„ 
if i G [0,A„]. Then in restriction to this set, by (|4.8p and (|4.9p and by the definition of 
v{p)7n have 

|E(AfX'^- A^X'- I J-f '0 - c'l'^^'Anl < KpATiV^n + il{p)li) 

|E(A^X'J' A^X"= A^X" AfX'- I J-f - {cl'^f^^ + d^^d^^ + cl''^cl^^)Al\ < Kp^T 
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(the constant above depends on p, through the bound K/p for the drift of c{p)). Then a 
simple calculation gives us 



n,jl n,km , n,jm n,kl\ 



5/2 



on A{p,n,i) (4.22) 



Next, we apply Lemma to the process Yt = c(p)(j_i)A„+t — c(p)(j_i)A„ for any given 
pair n, i, and with the filtration (-7^(i-i)A„+t V^'')t>o. We then deduce from (|4.7p . plus again 
dHZ]), that 



{i-l)A„+t 



jk 

'^(i-l)A. 



i G L{n,p), <t < knAn 



\(Jm 

AC(i_i)A„+t 



C{i-1)A, 



tc{p) 



n,jklm I 



<Kptr,{p)l,^ 



\nCi^-l)A^+t - C(,_l)A„ I - tKp)f II < Kptr,{p)l,^ 



(4.23) 

Moreover, the Cauchy-Schwarz inequality and (j4.19p on the one hand, and (I4.10p applied 
with the process Yt = — on the other hand, give us 

r |E(a:^'^' Afh{pr^ I J-f''') I < KAMI, u 24) 

^^^^nM ^n^ms | _^n,P) | < KpA^iV^ + r/(p)^i). ^ ' ' 



i G L(n, p) 



6) We now proceed to estimates on a" and /?", in the next two lemmas: 



Lemma 4.4 We have 



i G L(n, p) 



E« I J^OII < KpVA;(V5; + r/(p)J 



E/ — n,jk — n,lm, i -j~n,p\ 



n,jl n,km , n,jm n,kl\\ 
• C . ' + C . '-^ C ■ ' ) I 

< Kp^{,/A;; + r]{p)l 



Proof. The first claim above directly follows from the first part of (|4.22|) . For the second 

i i An n,km , n,jm n,kl , n,jk n,lm 

part, we set Q = Cj Cj + Cj Cj and write as 



1 



AJt^ 1 

X ^ njk n,lm 
■u=0 



kyi 2 /^Ti 1 

E\ ^ n,jk nim 
u=0 v=u+l 



-2 



!m , , \ ^ \ ^ nIm n,jk 



«=0 v=u+l 



By (|4.19|) and (|4.22|) and successive conditioning and the second part of Lemma r4.3l the F"'^- 
conditional expectation of the two last terms above is smaller than Kp\/ An (-v/A„ + ri{p)1), 
in restriction to the set {i G L(n,p)}. The conditional expectation of the first term, up to 
Kp An and on the same set again, is p- X]«=o^ ^i^i+u I ■^?'^)- Using the boundedness of ct 
and (liTljl and dHZ]), we easily check that |E(Q„ | J""''') - 4"| < Kpy/A^ on the same set 



when u < kn, and the second claim follows. 



□ 



Lemma 4.5 VFe have 



i G L(n, p) =^ < 



|E(/3f I J-f'O - ^fe(p)F|| < ApVA;(VA; + r?(p)-) 



nlm\ 



knAn -^^YL,jklm\ 



[ |E(a:^'^'^^:^''"^ I J-f'OI < KpVA;(VA; + r/(p)r). 



<Kp^{y/\; + r]{p)f 



, cip)'^'='^^\<Kp^i^ + rj{p)l 
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Proof. The first claim follows from the first part of (14.230 . For the last three claims, and 
for simplicity, we consider the case {j,k) = {l,m) and omit the indices below, but the same 
argument works in the general case. We have 

u=0 " u=0 v=u+l 

(|4.23p and successive conditioning and the second part of Lemma 14.31 yield that the T^'^- 
conditional expectation of the above is, on the set {i € L(n, p)}: 

" u=0 " u=0 v=u+l 

where < (rj{p)^ + ^ 'g' E(||cr+, - cnv{p)f+u.v-u I -^f'" 

^ " u=0 v=u+l 

Another application of the first part of (|4.23p , plus again the second part of Lemma 14.31 and 
ll^'ll < K/p and the Cauchy-Schwarz inequality and the fact that Eu=o^ u{2kn — 2u — 1) = 
^n/3+ 0(^n) as kn — )■ oo, and kn X A.„ , imply that the above expression is 

Coming back to the general situation in which (j. A;) and {I, m) may be different, we deduce 
the second claim. 

Next, we have 

M=0 tt=0 

The third claim is then proved exactly as the previous one. 
Finally, equals 

^n~l 2 fc^i 1 A^TT, 2 kfi 1 



y^2^ ^ 

U,V=Q U = V=U + 1 14 = v=u + l 

By successive conditioning, the J^^'^'-conditional expectation of the first term in the right 

side above is smaller than ETpA^^^ by (j4.19p and (|4.23p . Upon using additionally ()4.24p 
we see that the conditional expectations of the second and third terms are smaller than 
Kp-v/A„(-v/A„ + ri{p)f- Therefore the last claim holds. □ 

Finally, we get the following two estimates on the variables 7f of (14.18p . for any q >2: 

El n,jk nim i -r-n,p\ 2 t n,jl n,km , n,jm n,kl\ 

ieL{n,p) ^\ -'-^c{pr'''"'\<K^{^, + ^) (4.25) 

iE(ii7?r I ^r) < K,{^n<t>, + + ^). 
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To see that the first claim holds, one expands the product "/^'■^^ ^n,im successive 
conditioning, the Cauchy-Schwarz inequality and (|4.20p . (j4.23p and Lemmas 14.41 and I4.5| 
the contributing terms are 

n,jk n,lm , n,ik n,lm , ~sn,jk~snlm , "FinJk ~^n,lm 

(^i »i + »i+k,, (^i+k^ +Pi l^i + Pi+kr, Pi+kr. 

I / n,ik n,jk\/ n,lm n,lm\ ~nn,jk/ n,lm n,lm\ ~nn,lm/ n,jk n,jk\ 
'^\'^i+k„~^i )\^i+kn~^i >~Pi vCj+fc„ — Cj >—Pi \C-i+k„ ~ 

For the second claim we use (j4.20p and (j4.2ip . and it holds for all g > 2: 



4.4 The Behavior of Some Functionals of c(p). 

For /) > we set 

U{p)7 = Mp)n' 1{|IMp)?II></4}' where 

The aim of this subsection is to prove the following lemma: 

Lemma 4.6 Under (SA-r) and i3.19\) we have 

lim limsup E{U{py^) = 0. 



(4.26) 



Assumption (SA-r) is of course not really necessary for this. What is needed is the 
assumptions concerning the process q only. 

Proof. With the notation ()4.13p . and for / = 1,2 we define /^^'^(p)" and U^^\p)^ as above, 
upon substituting c{p) and </4 with S{p) and </8. Since f/(p)^ < W^'^\p)'^ +W^^\p)'i , 
it suffices to prove the result for each U^^\p)^. 

First, ||/J*-^Hp)j IP l{||^(i)(p)"||>n' /8} smaller than ivr||/x('-)(p)"||^/u'^, whereas (recalling 
\Hp)\\ < K/p) classical estimates yield E(||^(i) (p)^f ) < i^A„(l + A„/p). Thus the expec- 
tation of U^^\p)1 is less than K/S^J'^''^^' {I + A„/p), yielding the result for C/(^)(/>)^ 

Secondly, we have U^'^\p)'^ < \\p^'^\p)'^f and the first part of (ITOD yields 

^{WP^'^XpYif) < K(Pp Since (^p ^ as p ^ 0, the result for U^^\p)f follows. □ 



4.5 A Basic Decomposition. 

We start the proof of Theorem 13.11 by giving a decomposition of ^^'(g)" — V{g). For this, 

we use the following ingredients: 

1 - the easily checked equality (when t > A;,nA„) 

/O* 9iCs) ds = I ^[Vf Jl^^^^^- gic.) ds + ,(C.) ds 

+^ /o'"^" {9i^s){kn - 1 - [±;])+9{cit/A,.]-k,.+s)[±;]) ds; 
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2 - the key property c^" = + /3"; 

3 - a Taylor expansion of 5 (a" + around /3", and a Taylor expansion of g{P^) and of 
5f(cs) for s G ((i - 1)A„, {i + kn - 1)A„] around c". 

Then, a simple calculation shows that (^'(5)" — ^{9)t) = S'i=i ^t"'"') as soon as i > 
kn^n, and where (the sums on components below always extend from 1 to d): 

"■,3 _ rjr— v-^[i/A„]-fe„+l / /on „N „( Qn\ a „f pn\ -Tr-n,lm 



io'"''" {9{cs){kn +5(c[VA„]-fc„+.)[^]) d.. 

The leading term are y"'^, y"'"^ and 1/"'^, and we start by proving that the other terms 
are negligible, in the sense that they satisfy 

F,"'^' A (4.27) 
for all t. The rest of this subsection is devoted to proving (|4.27p for j = 1,4,6 successively. 



The case j = 1: ()3.6p implies 

\g{^)-g{7in\ < i^(i+iic?ii+iic:"iiri w^-ci^w < K{i+\mY-' ncr-c^^n+KHcr-c^^r. 

Recalling the last part of ()4.19p . we deduce from (j4.12p . from the fact that 1 — rw — p{l — 
2w) < — (1 — 2w) for all g > 1 small enough, and from Holder's inequality, that 

E(|5(c7) - (7(c',")|) < KanA^n"""^^^''". Therefore 

Ef sup \Vr'^\) < KtanA^^P-'^+^/'^-P 

^ s<t ' 

and (|4:271) for j = 1 follows. 

The case j = 4: We denote p" the ith summand in the definition of Vj"'^. By splitting the 
integral showing in this summand into pieces over the intervals ((? — !+ j)A„, [i + j)A„], 
we see that 

= - ^Qimgic'i) ^ pT+P, where pf = (c, - C(,_i)a„) ds- 

l,m j=0 J{i-1)^^ 

yields 

mpT I -^Dll < KAl E(||pf IP I < KAl 
implying in turn, because |(7(c")| < K and kn\/ An — )• 0: 

|E(pr I -^)| < KAl/\ Edp^f I < KAl 
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Since is .F^'^^^-measurable, we thus get 
which yields (j4.27p for j = 4. 

The case j = 6: In the definition of V^'^ the second term is smaller than Ky/A^ and thus 
goes to 0. As mentioned after (I3.2p . we have — )• cq and cj^/^ ]-/t +i — ^ whereas 
kn\/ An — )• 0, thus the first term converges to | (^(co) + g{ct-)), and so does the last term, 
because both /J""^" (/cn - 1 - [^]) (is and /(f"^" [^] ds are equivalent to A:2A„/2. Then 
we conclude (|4.27p for j = 6. 



4.6 The First Leading Term. 

Our aim here is to prove that 



yn,2 ^ ^ ^4 



(functional stable convergence in law), where Z is the process defined in Theorem 13.11 
Since cxf = ^ ^ Ej=o^ "^i+i' ^ change of indices allows us to rewrite F"'^ as 



[t/An] (i-l)A{fc„-l) 

rn,2 ^ \ ^ \ ^ n,/m nJm i n.im -'^ \ ^ n i on \ 

t =-r^ Z^^i ' ^^^^^ "'i =ir dlrr,g{Pi.j). 



i=l ' j=(i-[f/A„]+fc„-l)+ 

Observe that and af are measurable with respect to T"^ and J^+i' respectively, so by 
Theorem IX. 7. 28 of [5] (with G = and Z = in the notation of that theorem) it suffices to 
prove the following four convergences in probability, for all t > and all component indices: 

[t/A„]-fc„ + l 

t_ wf'^¥.{af'^\Tf) ^ Q (4.29) 



n 



i=l 

[i/An]-fcn + l 



E lE(ar"' «r''"^ I -^D ^ / d,,g{cs) di^aics) (c^'c^ + c^J^c'J) ds 

^ i=l 

(4.30) 

[t/A„]-k„ + l 

-2 E iK"riE(ii«?r 1-^) ^ (4.31) 

" i=i 

[t/A„]-fc„+l 

i E <''"E(a^''"^AriV I JT) A 0, (4.32) 

where = W'' for some A;, or is an arbitrary bounded martingale, orthogonal to W. 

For proving these properties, we pick a p bigger than the upper bound of the function 
J, so becomes the trivial cr-field and = J^^''^- In such a way, the set L{n, p) is the 
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set of all positive integers, and we can apply all estimates of the previous subsections with 
the conditioning cr-fields J-'l^. Therefore ()4.19p and the property \\w^\\ < K readily imply 
()4.29p and (I4.3ip . In view of the form of a^*, a usual argument (see e.g. [6]) shows that in 
fact E(a"''™ A^A^ I -^D = for ah N as above, hence (|i32]) holds. 

For (OOl) . by (021) it suffices to prove that 

[t/A„]-fc„+l t 

A\ ^ n,jk n,lm i n,jl n,km . n,jm nM\ P, n / \q / \ I il km , im kl\ j 

n 2^ (Ci'^c^' +Ci'' )^ djkg{cs)dimg{cs){cic, +ci c,)ds. 

i=l -^0 

In view of the definition of w^, for each t we have w^^^^-^ — )• djkg{ct) and c^^i^^-j — )■ c^*' 
almost surely if \i{n, t) A„ — t\< kn^n, and the above convergence follows by the dominated 
convergence theorem, thus ending the proof of (|4.28p . 



4.7 The Second Leading Term. 



In this subsection we prove that F"'^ ^ A^f'"' for all t. Set < = g{P^ + of) - g{/3f ) 



(c,l) 



Y.im(^img{l3i )a^'''"^- A Taylor expansion gives us 



v? = v{l): + v{2)^ + v{3)^, where <^ 



|^;(3)Vl <V(l + ||<llf~^l|aFll^ 



Therefore 



3 [t/An]~kn + l 

V^'^ = ^U{jr, where C/(i)," = ^ v{j)l 

j=i i=i 



(4.33) 



We deduce from Lemma HTHl and (again with /> > sup | J|, so L(n, p) = N*) and from 
\\d'^g{E'l)\\ < K that 



E 



[t/An]-A,'„ + l 

1=1 j,k,l,m 



0. 



Using kn\J it then follows from Riemann approximation (as for ()4.30p above, recall 
that for all t we have /5[^y^ j — Q a.s.) that 



(4.34) 



Next, f (2)" is -T^j'!,.^ -measurable and with vanishing J^J'-conditional expectation, hence 
(as in the case j = 4 above) 

[t/A„]-fc„+l [t/A„]-fe„+l 
E((C/(2)n') < 2A:„A„ ^ i^(|7;(2)7p) < 2A:„A„ J] i?(||arf ) < i^iA:„A„, 

j=l i=l 



Integrated function of volatility 



25 



where we have apphed (|4.20|) for the last inequahty. Hence f7(2)" — > 0. Finahy (j4.20p 
yields 

[t/A„]-fc„+l 

E(|C/(3)r|) < Ky^, ^((1 + ^ ^*^n'- 

i=l 



Then t/(3)J^ ^ 0, and in view of (|i33D and (0^ we have F"'^ ^ A^''^^ 

4.8 The Third Leading Term. 

Finally we prove that 

yn,5 JP^ (4.35) 

Below, denotes the zth summand in the definition of V^'^. We take p G (0, 1], and will 
eventually let it go to 0. With the sets L{n, p) of (|4.16p . we associate 

L(n, p, t) = {1, • • • , [t/An] -kn + l}n L{n, p) 
L{n, p,t) = {l,-- - , [t/An] -kn + l]\L{n, p). 

We split the sum giving V^"'^ into two terms: 

u!/'p = — \= y v^, 17"''' = — \= y 

i&L(;n,p,t) ieL(n,p,t) 

The processes U"''^. We do a Taylor expansion of both g{cs) and around c". 

Recalling /3" = f3f' — c^, we obtain (without mentioning the component indices for simplicity, 
and recalling that Cg and are uniformly bounded):: 

gm - g{cf) - dgic^)Pf - \ d^g{dlW: ®^:\< K\m? 

g{cs) - g{c2) - dg{c2){cs - c^) -^d^g{c^){cs - ® (c. - c^)\ < K\\cs - c^f ). 
It follows that = v{l)1 + ?;(2)7 + ?;(3)7, where 

^n,jklm _-nn,jk-3n,lm r Jk Jk \(Jm 2m \ 

^i,s -Pi Pi %-l)^^)\'^s ^{i-l)llj 

\vm <Kfi:^^;-y^- {\mr + \\cs- c(._i)A„f ) as. 

Therefore 

C/-.P = ^C/(j-)«.P, where 00)"/'' = -^ (4-36) 

J=l i&L(ri,p,t) 

Combining (|4.23|) and Lemma [4. 5t we see that for s G [(i — 1)A„, [i + kn — 1) A„] we have 
E(Cf' I -^r-O - -s + {t- l)A„)c(p)r'^'=''" < ( + ry(p)r). 
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Since 



s + (i - 1)A„) ds 



, and since L(n, p) is ^''-measurable, 



we deduce from Lemma 14.31 and ||5^(7(c")|| < K that 



E( 



k A^/^ 
12~ 



E E 4,;n.5(cr)c(p) 

i£L{n,p,t) j,k,l,m 



0. 



We observe that the number of indices lesser than [t/A„] — /c„ + 1 and not in L(n, p, t) is at 
most knN^, where Nf = ^((0, t] x Ep) is finite; hence, since further kn\/ An — s- 9, the sum 
inside the expectation above is a Riemann sum, and we deduce that 



^'-■=-Y2 ^ fd%^in.9{cs)c{p)f''^ds. (4.37) 

j,k,l,m " 



Next, f (2)" is -measurable and with vanishing ^^"'''-conditional expectation, hence 

< Kt(<i)p+^+^y 

where we have applied (j4.14p . (j4.17p and (j4.2ip for the last inequality. In the same way, we 
get 

E(|C/(3)ri)<ift(0p + Ay4 + ^). 
Since </)p — )• as p — )• 0, we readily deduce from the last two inequalities that 

lim limsupE(|C/(2)[^'''| + |f/(3)"'''|) = 0. (4.38) 



The processes C/"'^. We will use here the jump times 5f,S2, • • • of the Poisson process 
/i((0,t] X Ep), and will restrict our attention to the set On.t.p defined before (j4.13p . whose 
probability goes to 1 as n — )• oo. On this set, L{n, p, t) is the collection of all integers i which 
are between [5q/A„] — 2kn + 2 and [5g/A„] -|- 1, for some q between 1 and N^. Thus 

_ < ^ [5,7A„]+1 

<?=! " i=lS^/A„]-2k„+2 

The behavior of each H{n, p, q) is a pathwise question. We fix q and set S = Sq and 
On = [5'/A„]. Since S has a density, we can assume without loss of generality that S > a^A^ 
for all n. We have v'^ = v'-^ — v'-"^, where 

v'r = knA^[g{(3r) - EjkdM^?)^?''' 
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We will use the cadlag property of c. If a„ — 2kn + 2 < i < a„ — A;„ we have c" — cs- 
and /3" — t- C5_, and also Cg — t- cs_^ uniformly in s in the interval [{i — 1)A„, (i + kn — 1)A„], 
so , \ vf — )■ is obvious. 

Next we let Un-kn + l < i < a„ + l. We have c" cs-, and /3"-cg- - ^"~""+^ Acg 
is a simple consequence of the definition of /3", hence 

^ - 5 (cs- + ^" " + ' Acs) - E 5.fc5(c5, ) (4- + ^" " ^ ' ^^') ^ 0- 



Analogously, splitting the integration interval in the definition of u^'" into [{i — 1)A„, 5) and 
[S, {i + kn — 1)A„], we see that 

7— 5(cs-) 7 9{cs) - 2^ 5,fc5(c5_ ) ( + Acs ) ^ 0, 

and it follows that 

7— T- -g[CS- + 7 Acs j + —7 ff(c5-) + 7 <7(C5) ^ 0. 

Putting together the behavior of , \ vf for the sets of possibihties for z, as specified 
above, we easily deduce 

'Sn-l / / II 

H{n, p, q) - [9{cs,- + jr ^^5,) - (l - Tr)9icSg-) - jr 9{cs,- + Acs, 

and by Riemann integration this yields 

H{n,p,q) -^0 [ {g{cs,j- + wAcs^) - (1 - w)g{cs,j-) - wg'{cs,j- + AcsJ) dw, 
Jo 

which is indeed 6 G{csq-, Acsq)- Henceforth, we have 

A [/^=0J]G(c^P_,Ac5P). (4.39) 

q=l 



Proof of msl^ . On the one hand, V"'^ = U{lY^P + U{2)""P + U{3)""P + C/"'''; on the 
other hand, the dominated convergence theorem (observe that c{p)t — )• for all t) yields 

that C/f A^"'^'^ and ^"^^ as p ^ 0. Then the property (g35D fohows from g37D, 

(j4.38p and (j4.39p . This completes the proof of Theorem 13.11 

4.9 Proof of Theorem E31 

The proof of Theorem 13.41 follows the same line as in Subsection 14.81 and we begin with an 
auxiliary step. 
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Step 1) Replacing by c^". The summands in the definition (I3.14p of A"'"^'^ are 
^(cr,cf+fc„), where R{x,y) = J2j,k,i,mdjkM3(^)(y^'' " x^^)iy^"' - x^"'), and we set 

1=1 

We prove here that 

for all t, and this is done as in to the step j = 1 in Subsection I4.5i The function R is on 
Rl with \\d^R{x,y)\\ < K{1 + ||x|| + \\y\\)P^^ for j = 0, 1, by 1^. Thus 

Then, exactly as in the case afore- mentioned, we conclude (|4.40p . and it remains to prove 
that, for all t, we have 



Step 2) From now on we use the same notation as in Subsection 14. 8| although they denote 
different variables or processes. For any p G (0, 1] we have A'^''^''^ = — 



where 



E ^r, 

i(iL{n,p,t) 



The decomposition (I4.36P takes the form 



U'^'P = ELi U{jY'P, where U{j) 



n,p 



^(2)r = I T^UAm dkkm 9{C-) iT'' if"^ - 

v{2,)^ = v^-v{l)^-v{2)^. 



^ EieL(n,p,t) and 



Use c!' 



+ and (j3.6p and a Taylor expansion to check that 



b(3)ri<i^llT 



" l|2 niT^" II I 



We also have |v(2)^| < i^rhff (1 + Ha^H + 
Holder's inequality, yield 



)(i + ii«rii + ii/3riir'- 

hence (fOO]) . (ICTI) and (OHD . plus 



E(i^;(3)ri I on + E(b(2)rp i g") < u, + + 



A, 

and thus (|4.38p holds here as well. Moreover, (I4.25P again yields (I4.37p . with now 



1 



j,k,l,m 



s , 



ds. 
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This goes to —A^f^''^ + ^ A[^^^'^ as p — )■ 0. Therefore, we are left to prove that 



p > 



UP, with, as /9 ^ 0, C/f A -A'}'^\ 



(4.41) 



Step 3) Now we turn to [/"■''', which on the set i^n,t,p is, as before. 



Nf 



[S,7A„]+1 



^'t''' = ^ H{n, p, q), where H{n, p, 

9=1 



j=[5j/A„]-2fc„+2 



and we set 



[5,7A„] + 1 



j,k,l,m 



= [S^/A„]-2fcn+2 



We use cl" = /3" + a!^ and < and ([SSD to obtain 



K - v?\ < K{1 + ||ar+,J| + (||ar+,J| + iiani), 

yielding — vf] \ G^) < KAI/^ by (I4.20p and Holder's inequality. Therefore, as n — ?• oo. 



9=1 



< 



< 



< 



/a: 



9=li=[5,7A„]-2fc„ 
Aff [5^/A„]+l 

^(»=(E E 

9=1 i=[5^/A„]-2fc„ 



\v?-v?\ 



v?-v7\\G^ 



KkfiAn 



E.{Nf) 0. 



(4.42) 



We are thus left with studying the behavior of the variables H{n, p, q), and we do as in 
the previous subsection. Again, a„ = [S/A^]. If a„ — 2kn + 2 < i < an — kn (case 1), we 
have /3f cs- and /3^_^,^^ - cs- - Acs ^ 0, whereas if - fc^ + 1 < « < a„ + 1 



(case 2) we have ~^ '^•S' ^'^'^ /^I^ ~ '^•S'- 

(2fc„-a„+»)^ V /52 / ^Ar''''Ar''" 



Acs- — ^ 0. Hence IJ^ - 



0, where 



n—T-) o2 / 



in case 1 

C5 I /.^c^. ZAc^'" in case 2. 



We then deduce, by Riemann integration, that 
H{n,p, q) 



■^l E [\d%^i„,g{cs^-) + d%^in.9{cs,- + {l-w)Acs^))w'Ac^,lAcf^dw, 

j,k,l,ni''^ 

which is 6G' {cSq-, Acsq)- Combining this with (j4.42p . we obtain the first part of (|4.4ip . 

with = d^q=iG' (csP-, Acsp), which in turn satisfies the second part of ()4.4ip by the 
dominated convergence theorem. The proof of Theorem 13.41 is thus complete. 
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4.10 Proof of Theorem 1331 

The proof is once more somewhat similar to the proof of Subsection 14.81 although the way 
we replace by c'j" and further by + /3" is different. 

1) Preliminaries. The jth summand in (j3.2ip involves several estimators cj*, spanning the 
time interval ((j — 3)/cnA„, (j + 2)A;„A„]. It is thus convenient to replace the sets L{n,p), 
L{n,p,t) and L{n,p,t), for p,t > 0, by the following ones: 

L'{n, p) = {j = 3, 4, • • • : p{{{j - 3)fc„A„, (j + 2)A:„A„] x Ep) = 0} 
L'(n, p,t) = {3,-- - , [t/kn/^n] - 3} n L'{n, p) 
L'{n, p, t) = {3, • • • , [t/K/^n] - 3} n (N\L'(n, p)). 

For any p G (0, 1] we write V{F)'^ = V^'^ + V"''', where 

n — 2^jeL'{n,p,t) ' '^t — l^jeL'{n,p,t) '"j ' 

We also set 

xn-y _ c/n _ c/n m a _ an _ Rn 

"j'^ -'^jk„+l C(j-2)fc„ + l' ''jP-Pjkn+1 P(j-2)fc„+l 



(|iA2]) and the last part of (jiT9|) yield 

g > 1 ^ IE(«)^) < J^g A(,25--)-+i-9, E((u;f)«) < Kg. (4.43) 

Observe that J"?? is analogous to 7", with a doubled time lag, so it satisfies a version of 
(j4.25p and, for g > 2, we have 

^9/2 



€L'{n,p) ^ E(||57c'r <^^^'^(^'Ap + ^^^' + ^)- (4-44) 



2) The processes V"'''. (|3:22]l yields 

< K{i + iicf,_3).„+jr^ ii^jcf i|p.^i><} + 

Thus a (tedious) computation shows that, with the notation 



with V > arbitrary, we have |fj | < K{a^ + ||5"c'||^ + a'p) (with if depending on v) 
Therefore we have | V"'''| < if(S"'^ + C"''' + Df), where 



jeL'{n,p,t) jeL'{n,p,t) j=3 
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First, (j4.43p and Holder's inequality give us K{a'J^) < Kq^^An'''^'^ for any q > 1 and v > 0, 
where (recalling (j3.7p and (j3.19p for w and w') we have set l{q,v) = - {p{l - 2vo) V 

v{l — 2w + nj')). Upon choosing v small enough and q close enough to 1, and in view of 
(j3.7p . we see that l{q,v) > ^, thus implying 

E(Dn ^ 0. (4.45) 

Next, we deduce from (j4.44p that 

i&L'{n,p,t) 

and thus, since p > 3, 

lim limsup EdC;"'^!) = 0. (4.46) 
The analysis of B^''' is more complicated. We have d^'d = + z'p, where 

m=l 

hence 

a" < 4(1 + ||c(^_3)fc^+i||)P"^ (ll^jf klKWX/^} + ll></4} 
It easily follows that for all ^4 > 1, 

4"'^ < 16B,"'^'V4AP-2S,"''''V^5,"''''^ (4.47) 

where 

II T-"- 11^ 

^r™' = E,.L'(n,p,.) a(l)," = (1 + ll^'c ■-3).„+ilir^ ^ 

«(2)," = wz'rr i{ii.rii></4}, a(3)," = ii?'(^3).„-Mir' ii^rii'- 

On the one hand, (14.19^ and (I4.20p and Holder's inequality yield 

Then, exactly as for (I4.46p . and since tu' < |, we get for any p > 0: 

]E(^n>p>i) ^ 0. (4.48) 

On the other hand, observe that z'J^ = fJ,{p)j, with the notation (j4.26p . and as soon as 
j £ L'{n,p), so Lemma gives us 

lim limsup E(S"'^'^) = 0. (4.49) 

P— ^■O n— !>oo 
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Finally, (ITOD shows that E(||z^"||9| | J'J'^2)fc„+i) ^ Kq^p^/K^ for ah g > 2 and j G L'{n,p), 
whereas c'("_3j^, is -^("_2)fc ^;^-measurable, so (14.140 . (I4.19|) and successive conditioning 
yield E(a(3)^| | Q^') < Kg^py/K^. Then, again as for (liliHD . one obtains 

E{B'^'P'^) < Kpt. (4.50) 

At this stage, we gather ()4.45p - (j4.50p and obtain, by letting first n — )• oo, then p — )• 0, 
then A — 7- oo, that 

lim limsup E(|V"'''|) = 0. (4.51) 



3) The processes V"'''. With the previous notation Sj and Nf, and on the set ^n,p,t^ we 
have 

K 2 

W"=E E -lsU^.A.l^r (4.52) 

m=l i=-2 

This is a finite sum (bounded in n for each a;). Letting S = Sm for m and p fixed and 
= fci - [fci] ' that for any given j E Z the variable ^[s/KA„]+j)k^+i 

converge in probability to cs- if j < and to eg if j > 0, whereas for j = we have 

IP 

^S/k A ]k +1 ~ ''^nCs " (1 " Wn)cs — >• 0. This in turn implies 

j<Oor j>2 ^ jp^^^^^^j^^.c Ao 

By virtue of the definition of -y", and since — )• and also since Wn is almost surely in 
(0,1) and F is continuous and F(x, 0) = 0, one readily deduces that 

p J F(cs-,Ac5) ifi = l 
'^[5/fc„A„]+j ^ \ ifj/l- 



Coming back to (14.520 . we deduce that 

m=l 

In view of p.22p . an application of the dominated convergence theorem gives — )• V{F)t. 
Then (f3:23]l fohows from V(F)J* = V"''' + V"'^ and (11311) and (fCT]) . and the proof of 
Theorem 13.51 is complete. 



4.11 Proof of Theorem [Ml 

(a) is almost obvious: indeed, V{g)? converges with the rate and is asymptotically 

normal with asymptotic variance 2tg'{c)'^c^ {g' is the derivative of g). However, since g is 
one-to-one, the model index by the new parameter tg{c) is regular, and the MLE is tg{cn), 
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where Cc = Yli=i ' and clearly tg{cn) has the same asymptotic properties as V{g)^: 

this proves the result. 

(b) is also obvious: the properties of T" hold for all continuous processes X satisfying 
(A-0). Then, using the toy model of (a), the optimality proved above implies that tH{c) > 
2tg' {c)^(? for any constant c > 0, that is H > h. 
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